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Thermovibrio ammonificans type strain HB-1 T is a thermophilic (T opt : 75°C), strictly anaero- 
bic, chemolithoautotrophic bacterium that was isolated from an active, high temperature 
deep-sea hydrothermal vent on the East Pacific Rise. This organism grows on mineral salts 
medium in the presence of C0 2 /H 2 , using N0 3 " or S° as electron acceptors, which are re- 
duced to ammonium or hydrogen sulfide, respectively. T. ammonificans is one of only three 
species within the genus Thermovibrio, a member of the family Desulfurobacteriaceae, and it 
forms a deep branch within the phylum Aquificae. Here we report the main features of the 
genome of T. ammonificans strain HB-1 T (DSM 15698 T ). 



Introduction 

The genus Thermovibrio consists of three validly 
published, named species: T. ammonificans strain 
HB-1 T [1], T. ruber strain ED11/3LLK T [2] and T. 
guaymasensis strain SL19 T [3]. All three 
Thermovibrio spp. are anaerobic, chemolitho- 
autotrophic bacteria that grow on mineral salts in 
the presence of carbon dioxide and hydrogen, reduc- 
ing nitrate or sulfur to ammonium or hydrogen sul- 
fide, respectively. T. ammonificans was isolated from 
an active high-temperature deep-sea hydrothermal 
vent located on the East Pacific Rise at 9° North, 
while T. ruber was isolated from shallow water hy- 
drothermal vent sediments in Papua New Guinea 
and T. guaymasensis from a deep-sea hydrothermal 
vent chimney in the Guaymas Basin [1-3]. Anaerobic 
chemolithoautotrophic bacteria mediate the transfer 
of energy and carbon from a geothermal source to 
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the higher trophic levels. These anaerobic primary 
producers, which depend on inorganic chemical 
species of geothermal origin p.e., carbon dioxide, 
hydrogen and sulfur], are completely independent 
from photosynthetic processes and represent an 
important component of the deep-sea hydrothermal 
vent ecosystem. Furthermore, microorganisms such 
as T. ammonificans, which also couple autotrophic 
carbon dioxide fixation with nitrate respiration, are 
of particular interest, as they link the carbon and 
nitrogen cycle, the latter of which has been under- 
studied at deep-sea hydrothermal vents. Here we 
present a summary of the features of T. 
ammonificans strain HB-1 T and a description of its 
genome. 
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Classification and features 

Thermovibrio ammoniflcans strain HB-1 T (=DSM 
15698 T =JCM 12110 T ] is a member of the phylum 
Aquiflcae, a group of thermophilic, deeply branch- 
ing bacteria thought to be among the oldest on 
Earth. The phylum Aquificae consists of a single 
order, the Aquificales, which is composed of three 
families, Aquificaceae, Hydrogenothermaceae and 
Desulfurobacteriaceae (Figure 1]. The genus 
Thermovibrio belongs to the family 
Desulfurobacteriaceae, which also includes the gen- 
era Desulfurobacterium, Balnearium and the newly 
described Phorcysia [6-8]. While the genomes of 
several members of the families Aquificaceae and 
Hydrogenothermaceae have been sequenced, the 
only genome sequences publicly available for the 
Desulfurobacteriaceae are those of T. ammoniflcans 
and Desulfurobacterium thermolithotrophum [9]. 

Table 1 summarizes the classification and general 
features of Thermovibrio ammoniflcans HB-1 T . Cells 
of T. ammoniflcans are Gram-negative, motile rods 
of about 1.0 \im in length and 0.6 |im in width (Fig- 
ure 2]. Growth occurs between 60 and 80 °C (opti- 
mum at 75 °C], 0.5 and 4.5% (w/v] sodium chloride 
(optimum at 2%] and pH 5 and 7 (optimum at 5.5]. 
Generation time under optimal conditions is 1.5 h. 
Growth occurs under chemolithoautotrophic condi- 
tions in the presence of hydrogen and carbon diox- 
ide, with nitrate or sulfur as the electron acceptor 
and with concomitant formation of ammonium or 
hydrogen sulfide, respectively. Thiosulfate, sulfite 
and oxygen are not used as electron acceptors. Ace- 
tate, formate, lactate and yeast extract inhibits 
growth. No chemoorganoheterotrophic growth was 
observed on peptone, tryptone or Casamino acids. 
The genomic DNA G+C content is 52.1 mol% [1]. 

Chemotaxonomy 

None of the classical chemotaxonomic features 
(peptidoglycan structure, cell wall sugars, cellular 
fatty acid profile, respiratory quinones, or polar 
lipids] are known for Thermovibrio ammoniflcans 
strain HB-1 T . 

Genome sequencing information 

Genome project history 

T. ammoniflcans was selected for genome se- 
quencing because of its phylogenetic position 
within the phylum Aquiflcae and because of its 
ecological function as a primary producer at deep- 
sea hydrothermal vents. Sequencing, finishing and 



annotation were carried out by the US DOE Joint 
Genome Institute QGI]. Table 2 shows a summary 
of the project information and its association with 
MIGS version 2.0 compliance [17]. 

Growth conditions and DNA isolation 

T. ammoniflcans was grown in two liters of modi- 
fied SME medium at 75 °C under a H2/CO2 gas 
phase (80:20; 200 kPa] with C0 2 as the carbon 
source and nitrate as the electron acceptor [1]. 
Genomic DNA was isolated from 0.5 - 1 g of pellet- 
ed cells using a protocol that included a lyso- 
zyme/SDS lysis step, followed by two extractions 
with phenol:chloroform:isoamyl alcohol (50:49:1] 
and ethanol precipitation. This procedure yielded 
about 25 |ig of genomic DNA, which was submit- 
ted to the DOE JGI for sequencing. 

Genome sequencing and assembly 

The genome of Thermovibrio ammoniflcans was se- 
quenced at the DOE JGI [18] using a combination of 
Illumina [19] and 454 platforms [20]. The following 
libraries were used: 1] An Illumina GAii shotgun 
library, which generated 10,255,5615 reads totaling 
7,794 Mb; 2] A 454 Titanium standard library, which 
generated 186,945 reads; and 3] A paired end 454 
library with an average insert size of 11.895 +/- 
2.973 kb, which generated 115,495 reads totaling 
104.7 Mb of 454 data. All general aspects of library 
construction and sequencing performed at the JGI 
can be found at the JGI website [21]. The initial draft 
assembly contained 16 contigs in 2 scaffolds. The 
454 Titanium standard data and the 454 paired end 
data were assembled together with Newbler, ver- 
sion 2.3. The Newbler consensus sequences were 
computationally shredded into 2 kb overlapping 
fake reads (shreds]. Illumina sequencing data was 
assembled with VELVET, version 0.7.63 [22], and the 
consensus sequences were computationally shred- 
ded into 1.5 kb overlapping fake reads (shreds]. The 
454 Newbler consensus shreds, the Illumina 
VELVET consensus shreds and the read pairs in the 
454 paired end library were integrated using paral- 
lel phrap, version SPS - 4.24 (High Performance 
Software, LLC]. The software Consed [23] was used 
in the finishing process. Illumina data were used to 
correct potential base errors and increase consensus 
quality using the software Polisher developed at JGI 
(Alia Lapidus, unpublished]. Possible mis-assemblies 
were corrected using gapResolution (Cliff Han, un- 
published], Dupfinisher [24], or sequencing cloned 
bridging PCR fragments with subcloning. Gaps be- 
tween contigs were closed by editing in Consed, by 
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PCR and by Bubble PCR (J-F Cheng, unpublished] 
primer walks. A total of 46 additional reactions 
and 1 shatter library were necessary to close gaps 
and to raise the quality of the finished sequence. 
The total size of the genome is 1,759,526 bp 



(chromosome and plasmid] and the final assembly 
is based on 67.7 Mb of 454 draft data, which pro- 
vide an average 40 x coverage of the genome, and 
7,284 Mb of Illumina draft data, which provide an 
average 4,285 x coverage of the genome. 
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Hydrogenobacter thermophilus (Z301 89) 
— Thermocrinis ruber (NR_041 946) 



100 



Aquifex pyrophilus (M83548) 
— Hydrogenivirga okinawensis (NR_041333) 



- Hydrogenobaculum acidophilum (D1 6296.1) 



100 



100 



100 



100 



Sulfurihydrogenibium azorense (AF528192) 

Venenivibho stagnispumantis (NR_044029) 

Hydrogenothermus marinus (AJ292525) 



Persephonella marina (AF1 88332) 



-Thermosulfidibacter takaii (NR_041 547) 
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Phorcysia thermohydrogeniphila (HQ336972) 
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Balnearium lithotrophicum (NR_028647) 



61 



43 



0.02 



66 



93 



78 



72 



— Thermovibrio ammonificans (AY263403) 
Thermovibrio guaymasensis (AY268937) 

Thermovibrio ruber (AJ316619) 

— Desulfurobacterium atlanticum (AY268939) 

Desulfurobacterium pacificum (AY268936) 

Desulfurobacterium thermolithotrophum (AJ001049) 

- Desulfurobacterium crinifex (AJ507320) 
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Figure 1. Phylogenetic position of Thermovibrio ammonificans HB-T relative to other type strains within the 
Aquificae. Sequences were aligned automatically using CLUSTAL X and the alignment was manually refined us- 
ing SEAVIEW [4,5]. The neighbor-joining tree was constructed with Phylo_Win, using the Jukes-Cantor correc- 
tion [4]. Bootstrap values based on 100 replications. Bar, 0.02 substitutions per nucleotide position. 
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Figure 2. Electron micrograph of a platinum shadowed cell of Thermovibrio 
ammonificans strain HB-1 T showing multiple flagella. Bar, 1 urn. 



Table 1. Classification and general features of Thermovibrio ammonificans HB-1 T 



MICS ID 


Property 


Term 


Evidence code 






Domain Bacteria 


TAS 


[10] 






Phylum 'Aquificae' 


TAS 


[11] 






Class Aquificae 


TAS 


[12,13] 




Current classification 


Order Aquificales 


T A C 

I Ad 


[ I l f I 4, I J J 






Family Desulturobacteriaceae 


TAS 


[15] 






Genus Thermovibrio 


TAS 


[2] 






Species Thermovibrio ammonificans 


TAS 


[1] 






Type strain HB-1 T 








Gram stain 


Negative 


TAS 


[1] 




Cell shape 


Short rod 


TAS 


[1] 




Motility 


motile 


TAS 


[1] 




Sporulation 


non-sporulating 


TAS 


[1] 




Temperature range 


60-80 


TAS 


[1] 




Optimum temperature 


75 


TAS 


[1] 




Carbon source 


co 2 


TAS 


[1] 




Energy source 


H 2 


TAS 


[1] 




Terminal electron acceptor 


N0 3 ", S 0 


TAS 


[1] 


MIGS-6 


Habitat 


Marine, deep-sea hydrothermal vent 


TAS 


[1] 


MIGS-6.3 


Salinity 


20 g NaCI I" 1 (range 5 - 45 g NaCI I" 1 ) 


TAS 


[1] 


MIGS-22 


Oxygen 


Anaerobe 


TAS 


[1] 


MIGS-15 


Biotic relationship 


free-living 


TAS 


[1] 


MIGS-14 


Pathogenicity 


Not pathogenic 


NAS 




MIGS-4 


Geographic location 


East Pacific Rise 


TAS 


[1] 


MIGS-5 


Sample collection time 


April 2000 


TAS 


[1] 


MIGS-4. 1 


Latitude - 


9° 50' N 


TAS 


[1] 


MIGS-4.2 


Longitude 


104° 18' W 


TAS 


[1] 


MIGS-4.3 


Depth 


2500 m 


TAS 


[1] 


MIGS-4.4 


Altitude 


not applicable 







Evidence codes - IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report 
exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, iso- 
lated sample, but based on a generally accepted property for the species, or anecdotal evidence). These 
evidence codes are from the Gene Ontology project [1 6]. 
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Table 2. Project information 



MIGS ID 


Property 


Term 


MIGS-31 


Finishing quality 


Finished 


MIGS-28 


Libraries used 


454 pyrosequence standard library, 454 Paired End, lllumina 


MIGS-29 


Sequencing platforms 


454 GS FLX Titanium and lllumina GAM 


MIGS-31. 2 


Fold coverage 


4,325 x 


MIGS-30 


Assemblers 


Newbler 2.3, Velvet 0.7.63 


MIGS-32 


Gene calling method 


Prodigal 1 .4 




Genome Database release 


January 7, 201 1 




Genbank ID 


NC_0 14926 




GOLD ID 


Gc01577 




Project relevance 


Chemosynthetic ecosystems, C0 2 fixation, Thermophiles 



Genome annotation 

Genes were identified using Prodigal [25] as part 
of the Oak Ridge National Laboratory genome 
annotation pipeline, followed by a round of manu- 
al curation using the JGI GenePRIMP pipeline [26]. 
The predicted CDSs were translated and used to 
search the National Center for Biotechnology In- 
formation (NCBI] nonredundant database, 
UniProt, TIGRFam, Pfam, PRIAM, KEGG, COG, and 
InterPro databases. These data sources were 
combined to assert a product description for each 
predicted protein. Non-coding genes and miscel- 
laneous features were predicted using tRNAscan- 
SE [27], RNAMMer [28], Rfam [29], TMHMM [30], 
and signalP [31]. 



Genome properties 

The genome includes one circular chromosome 
and one plasmid, for a total size of 1,759,526 bp 
(chromosome size: 1,682,965 bp; GC content: 
52.13%]. Of the 1,888 genes predicted from the 
genome, 1,831 are protein-coding genes. Of the 
protein coding genes, 1,279 were assigned to a 
putative function, with those remaining annotated 
as hypothetical proteins. The properties and the 
statistics of the genome are summarized in Figure 
3 and Tables 3 and 4. 



Table 3. Genome statistics 



Attribute 


Value 


% of total 3 


Genome size (bp) 


1,759,526 




DNA Coding region (bp) 


1,674,589 


95.17% 


DNA G+C content (bp) 


917,237 


52.13% 


Chromosome (bp) 


1,682,965 




Plasmid (bp) 


76,561 




Total genes 


1888 




RNA genes 


57 


3.02% 


Protein-coding genes 


1831 


96.98% 


Genes in paralog clusters 


2 


0.11% 


Genes assigned to COGs 


1419 


75.16% 


Genes with signal peptides 


535 


28.34% 


Genes with transmembrane helices 


369 


19.54% 


Paralogous groups 


1 


1 00% 



a The total is based on either the size of the genome in base pairs or 
the total number of protein coding genes in the annotated genome. 
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Table 4. Number of genes 


associated with the 25 general COG functional categories 


Code 


Value 


% age a 


Description 


J 


154 


8.41 


Translation, ribosomal structure and biogenesis 


A 


- 


- 


RNA nrocp^inQ and modification 

\ IJ 1 L_/L,C7 J J 1 1 1 Ql IU II lull 1 1 1 V— d LI Ul 1 


K 


70 


3.82 


Tr^nQprinfion 

1 1 al I IJ LI vjl 1 


L 


78 


4.25 


Rpnlicafion rpcomhinafion and rpnair 

l\v7kJ 1 1 V . (.1 Llul 1 , 1 V . A V / 1 | |U|| | a LI L/l 1 d 1 1 U 1 CTlJd 1 1 


B 


3 


0.16 


fhrnmatin cfri if~fi irp ann n\/n3mirc 

V, 1 M IJI 1 Icl U 1 1 5U ULIU[C dl IU Uy 1 Idl 1 1 II— J 


D 


40 


2.18 


i pll f~\/f~[f} rnntrn mifocic ^inrl mpincic 

V_,dl L,yi_lt: L.UIILIWI, III1LW.51.5 dllU 1 1 ItllJIMa 


Y 


- 


- 


hvliiflpar cfTiif^fiirp 

INULlcdl 3L[ ULIUI c 


V 


33 


1.80 


npfpncp mprhanicmc 

L>*CICII3C [ 1 ltd 1 dl 1 131 1 1> 


T 


71 


3.87 


Qiona [rancni irfinn mprhanicmc 

Jltilldl L[ dl I3U UL—LIWI 1 1 1 ICLI Idl 1 131 1 la 


M 


135 


7.37 


Cell wall/membrane biogenesis 


N 


70 


3.82 


(" pll mofilitA/ 
v.. v. 1 1 1 1 lull II Ly 


Z 


- 


- 


^yLOSKeicLOIl 


w 


- 


- 


Fvfr^i cf±\ 1 1 1 f QfriipfiirpQ 

l_ A LI d L—Cl I U I d 1 Jll ULIUI O 


u 


67 


3.66 


1 ntr^ ppm 1 1 1 \t\ r fr^TTipk'incT ^nrl QPfrption 

1 1 1 11 <_l L_ L_ 1 1 U 1 cl 1 U <_l 1 II L_ l\ 1 1 1 XL d 1 1 U 5CL 1 L_ 1 1 L ) 1 1 


o 


92 


5.02 


Posttranslational modification, protein turnover, chaperones 


c 


158 


8.63 


Fnprcrv nroniipfion ann rnnvprcinn 

LllCltiy Ul UUULIIUI 1 dllU L-Wl i v CI 3 l\J 1 I 


G 


77 


4.20 


rnnnvrlr^tp fr^nQnorf ^nrl mpt^nol iQm 

v_,d 1 LJLJ 1 1 y u i a LC LI ul I^UUI I d 1 1 U ll ICLdUW 1 1 j> 1 1 1 


E 


155 


8.46 


Amino acid transport and metabolism 


F 


72 


3.93 


Nucleotide transport and metabolism 


H 


120 


6.55 


i opn7\/mp francnorf and mpfahn icm 

LUCI IZ. y l 1 It LI dl IDIJIJI L dl IU 1 1 ICLaUUl 131 1 1 


I 


46 


2.51 


Lipid transport and metabolism 


P 


102 


5.57 


Inorcr^nip ion Ir^inQnorf ^nn mpf^noliQrn 

1 1 1 w 1 &d IIIL ILJII LldllalJwlL dllU II ICLcluul I jl 1 1 


Q 


38 


2.08 


^prnnn^n/ mpfann ifpc hiocvnfnpicic fr^ncnorf ^nn r^tann icm 

JcLUI 1 Udl y l 1 IGLdUL_M I LC5 IJ l(JDy 1 1 LI IC51 3, Lldll>LJL_ML dl IU LdLdUUl 1 31 1 1 


R 


220 


12.02 


General function prediction only 


S 


92 


5.03 


Function unknown 




412 


22.50 


Not in COGs 



a The total is based on the total number of protein coding genes in the annotated genome. 



http://standardsingenomics.org 



87 



Thermovibrio ammonificans HB-1T 



1650001 



1550001 

\ 



1500001 



1400001 



1350001 



1 30000 




1100001 



350001 



500001 



550001 



600001 



,//! 




f 










< 



700001 



850001 800001 



Figure 3. Graphical circular map of the genome. From outside to the center: Genes on forward strand (color by COG 
categories), Genes on reverse strand (color by COG categories), RNA genes (tRNAs cyan, rRNAs red, other RNAs blue), 
GC content, GC skew. 
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